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® Antigenicaily active proteins and peptides, and transmissible gastro-entertHs virus TGEV vaccines. 

© The invention relates to new antigenic proteins 
which can be used to stimulate the immunity of pigs 
against transmissible gastro-enteritis virus (TGEV)- 
vaccines, and to fragments or derivatives thereof. 
The invention also relates to vaccines which com- 
prise such antigenic proteins or peptides. 

The nucleotide sequence and amine acid se- 
quence of a gene coding for protein of an TGEV- 
strain are indicated in figure 1. 
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New antlgenicaily active proteins and peptides, and transmissible gastro-enteritis virus (TGEV)- 

vaccines. 



The invention relates to a new antigenic protein 
which is capable of stimulating the immunity of 
pigs against transmissible gastro-enteritis virus 
(TGEV), and to fragments or derivatives thereof. 
The invention further relates to vaccines which 
comprises such an antigenic protein or peptide. 

Transmissible gastro-enteritis is an infection 
disease of the intestine in pigs causing high mortal- 
ity in piglings younger than 2 weeks and hence 
great economic losses to the pig breeders. Insprte 
of the existance of various types of vaccines, the 
problem of TGE has not been solved effectively. 
Therefore, the development of new, better vaccines 
against TGE is urgently necessary. 

The cause of TGE is a virus which belongs to 
the group or corona viruses. Corona viruses are 
positive single-stranded RNA viruses having a lipid 
envelope. The TGEV genome RNA has a length of 
approximately 23,600 bases. The virus particle 
(virion) comprises three important proteins, namely 
the peplomeric protein (S), the nucleocapside pro- 
tein (N), and the matrix protein (M). 

The characteristic protrusions at the surface of 
the virion are formed by the peplomeric protein (S). 
Each peplomer is formed by a glycopolypeptide. 
The protein is fixed in the membrane by means of 
a terminal membrane anchor sequence. It is known 
from literature that antibodies generated against the 
peplomeric protein of the said virus, provide pro- 
tection to the pig. 

According to the present invention the 
nucleotide sequence and the amino add sequence 
of the TGEV Purdue strain were determined. 

The invention therefore relates to proteins and 
fragments thereof which comprise at least one of 
the areas which play a part in the protection 
against TGE infections. 

Theoretically, such fragments can be used as 
such as an antigen for vaccinating against TGE 
However, it is to be preferred to couple said frag- 
ments to a suitable carrier. 

The peptide fragments or proteins according to 
the invention which comprise at least one antigeni- 
cally active peptide fragment can be prepared ac- 
cording to methods known for the preparation of 
peptides and proteins. 

First of all, the peptides can be prepared syn- 
thetically by means of known techniques starting 
from the individual amino adds or smaller peptide 
fragments. 

The peptides can also be obtained biosyntheti- 
cally using recombinant DNA techniques and ex- 
pression systems, for example, by: 



a) transformation of host cells with an ex- 
pression vector which comprises a DNA, coding for 
an (the) antigenic determinants) (peptides in gen- 
eral); 

s b) expressing the genome inserted in the 

expression vector 

c) harvesting the cell culture, and 

d) separating the synthesised peptide 
(protein). 

10 The Invention therefore also relates to a meth- 
od of preparing a DNA molecule which codes for 
peptides according to the invention. Such a method 
comprises the folllowing steps: 

a) isloation of TGE single-stranded RNA; 

is b) synthesis of a cDNA strand complemen- 

tary to the RNA strand mentioned in a); and 

c) removal of the RNA molecule and synthe- 
sis of a second cDNA strand, using the first cDNA 
strand as a template. 

20 The double-stranded DNA obtained in this 
manner may be incorporated in an expression vec- 
tor in known manner so that a recombinant expres- 
sion vector is formed. This vector may be intro- 
duced into a suitable host cell, for example, by 

25 transformation. 

The invention will now be described in greater 
detail with reference to the ensuing specific exam- 
ple. 

30 

EXAMPLE 

a) Virus and tissue culture. 

35 TGEV strain Purdue was purified twice by 
means of pricking of a plaque on PD5 cells. Culture 
conditions were described previously in L Jacobs 
et al, J. Virol. 57; (1986); 1010-15. 

40 

b) Isolation of subgenomic mRNA's 

PD5 cells (9 x 10 7 ) were infected with an infec- 
tion multiplicity of 15. Seven hours after infection 

45 actinomycine D was added and 10 hours after 
infection the cells well lysed and the RNA ex- 
tracted, as described previously by W.J.M. Spaan 
et ah, Virol. 108 (1981); 424-434. The RNA (560 
ug) was dissolved in a buffer having a high salt 

so concentration (10 mM Tris-HCI pH 7.5, 0.05% SDS, 
500 mM Nad) and transferred to an oligo(dT> 
ceilulose column for a selection on poly (A) + RNA. 
After elution with 10 mM Tris-HCI pH 7.5the RNA 
was concentrated by precipitation with ethanol and 
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fractionated by means of an isokinetic sucrose gra- 
dient (B.A.NI. van der Zeijst et ai M in G.D. Fasman 
(Ed.). Handbook of biochemistry and molecular bi- 
ology, 3rd ed. Physical and Chemical data. Vol. ^ 
CRC Press Inc., Boca Raton, Fla. (1976), gg. 428^ 
519). Before providing the RNA on the gradient rt 
was dissolved in 50 mM Tris-HCI pH 7.4, 10 mM 
LiCI. 1% SDS and heated at 56°C for 1 minute, 
^-labelled TGEV mRNA's were added as mark- 
ers. The identification and translation of TGEV 
mRNA's is described by Jacobs et al., 1986. 



c) Cloning of TGEV peplomer (E2) gene. 

RNA3 obtained after purification over the su- 
crose gradient was used for the synthesis of cONA 
(carried out as described by Gubier and Hoffman 
(Gene, 25 (1983), 263-269). Caff-thymus pentamers 
were used as primers and methyl mercury hydrox- 
ide for denaturing the RNA. The double-stranded 
cDNA was elongated with dC residues and then 
cloned in a pUC9 vector elongated with dG resi- 
dues. The DNA was used for the transformation of 
E. coli strain JM109 as described by D. Hanahan et 
al., J. Mol. Biol., 166, (1983), 557-580. In this 
manner a cDNA library of approximately 900 trans- 
formants was obtained. The plasmides comprise 
insertions having a length up to 5kb. 



d) Selection of E2-specific recombinants. 

E2-specific recombinants could be identified by 
using the homology between TGEV and FIPV. Two 
restriction fragments from the E2 gene of FIPV 
were labelled with ^-dATP by means of nick 
translation and were used for the selection of the 
recombinants. Twenty-four recombinants hybridis- 
ed with probe 2, eight with probe 1 and two recom- 
binants hybridised with both probes. A restriction 
map of three recombinants was made. The recom- 
binants which represent the whole peplomer gene 
were used for sequence analysis. Restriction frag- 
ments were separated in an agarose gel and iso- 
lated by means of an NA45 membrane (Schleicher 
and Schuell) and cloned in M13 vectors (mp8 
and/or mp19). Sequence analysis was carried out 
according to the dideoxy-nucleotide termination 
method of Sanger et al. f Proc. Nath. Acad. Sci. 
USA 74, (1977), 721-732. Both the universal M13 
primer and E2-specific primers were used. The 
data were analysed by means of a DEC20/60 com- 
puter and the programs of Staden. Nucl. Acid. Res. 
10, (1982). 4731-4751. 



e) Nucleotide sequence. 

The nucleotide sequence of the peplomer gene 
of TGEV (Purdue strain) is shown in figure 1. The 

5 sequence comprises an open reading frame of 
4347 nucelotides with a coding capacity for a pre- 
cursor protein of 1449 amino acids (nucleotide po- 
sition 155-4502). Upstream of the ATG codon and 
downstream of first stop codon a recurring se- 

10 quence (ACTAAACT) is present This "irrtergenic" 
sequence also flanks the E2 gene of RPV and has 
also been found upstream of the nudeocapsid 
gene of TGEV and MHV. 

75 

f) Amino acid sequence. 

The ATG translation initiation codon (position 
161) is succeeded by 15 amino adds which prob- 

20 ably form the deavabie signal peptide, two hydro- 
philic amino adds succeeded by 13 hydrophobic 
amino acids. The extreme carboxy terminal part, 
position 4319-4393 (25 amino adds) comprises a 
hydrophobic region which presumable serves as a 

25 transmembrane anchor, both regions are under- 
lined in Figure 1. 



Claims 

30 

1. A gene which codes for protein of transmis- 
sible gastro-enteritis virus (TGEV), characterized by 
the nucelotide sequence as shown in Figure 1. 

2. A protein, characterized by the amino add 

55 sequence as shown in Figure 1, or a part thereof. 

3. An immunogen, characterized in that it com- 
prises a protein or peptide as claimed in Claim 2, 
whether or nor coupled to a carrier. 

4. A vacdne, characterized in that it comprises 
40 an immunogen as daimed in Clam 3. 

5. A method of preparing a protein or peptide 
as claimed in Claim 2, characterized in that 

a) the protein or peptide is synthesised in a 
manner known per su from individual amino adds 

45 and/or by coupling smaller peptides; or 

b) a host cell is transformed with an expres- 
sion vector which comprises a DNA coding for a 
protein or peptide as claimed in Claim 2, and the 
genome introduced in the expression vector is ex- 

so pressed. 

6. A DNA molecule which comprises a 
nucleotide sequence which codes for a protein or 
peptide as claimed in Claim 2, or which codes for a 
polypeptide which comprises a protein or peptide 

56 as daimed in Claim 2. 

7. A recombinant DNA expression vector which 
expresses the whole protein or peptide or a part 
thereof as defined in Claim 2, comprising an 
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operon with initiator sequences and terminator se- 
quence and a nucleotide sequence which codes for 
the said protein or peptide; the said nucleotide 
sequence being situated between the initiator se- 
quence and the terminator sequence of the operon. 5 

8. A host cell which comprises a recombinant 
DNA expression vector and/or DNA molecule as 
claimed in Claim 6 or 7. 

9. A method of preparing a DNA molecule 
which codes for a protein or peptide as claimed in 10 
Claim 2, characterized in that 

a) angle-stranded TGEV-RNA is isolated; 

b) a cDNA strand complementary to said 
RNA strand is synthesised; 

c) the RNA strand is removed and a second is 
strand of cDNA is synthesised with the first cDNA 

as a template. 
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*HT®KKLFVVLVVAPLIY6D 
TTAACACACCATG AAAAAACTATTTGTRnT.TTTGGTCGTftflTGnCATTnATTTATDr^ nA 
160 170 ISO 190 200 210 

NFP CSKLTNR T I GN QWNL I E 
CAATTTTCCTT6TTCTAAATTGACTAATAGAACTATAG6CAACCAGTGGAATCTCATTGA 
220 230 240 250 260 270 

TFLLNYSSRLPPNSDVV LGD 
AACCTTCCTTCTAAACTATAGTAGTAGGTTACCACCTAATTCAGATGTGGTGTTAGGTGA 
230 290 300 310 320 330 

YFPTVQPWFNCIRNNSNDLY 
TTATTTTCCTACTGTACAACCTTGGTTTAATTGCATTCGCAATAATAGTAAT6ACCTTTA 
340 350 360 370 330 390 

V TLENLKAVYUO t'ATENITU 
TGTTACACTGGAAAATCTTAAAGCAGTGTATTGGGATTATGCTACAGAAAATATCACTTG 
400 410 420 430 440 450 

NH'RQRLNVVVNGYPrsiTVT 
GAATCACAGACAACGGTTAAACGTAGTCGTTAATGGATACCCATACTCCATCACAGTTAC 
4 60 470 480 490 500 510 

TTRN FNSAEGAIICICKGSP 
AACAACCCGCAATTTTAATTCT6CTGAAGGTGCTATTATATGCATTTGTAAGGGCTCACC 
520 530 540 550 560 570 

PT TTTESSLT-CNUGSECRLN 
ACCTACTACCACCACAGAATCTAGTTTGACTTGCAATTGGGGTAGTGAGT6CAGGTTAAA 
530 590 . 600 610 620 630 

H K F P ICPSNSEA NCGNflL YG 
CTflTAAGTTCGCTATATGTCCTTCTAATTCAGAGGCAAATTGTGGTAATATGCTGTATGG 
640 650 660 670 680 690 

LQWFADEVVAYLHGASYRI S 
CCTACAATGGTTTGCAGATGAG6TT6TTGCTTATTTACATGGTGCTAGTTACCGTATTAG 
700 710 720 730 740 750 

FENGWSGTVTFGDttRATTLE 
TTTTGAAAATCAATGGTCTGGCACTGTCACATTTGGTGATATGCGTGCGACAACATTAGA 
760 770 780 790 800 810 
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VAGTLVDLWWFNPVYDVSYY 
AB.TCCGTU6CAC6CT-TBTA6ACETTTG6T6GTTTAATECTCTTTATBATB-TCAGTTATTA- 
820 830 840 S50 860 870 

•RVNNKNGTTVVSNCTDQCAS 
TAGGGTTAATAATAAAAATGGTACTACCGTAGTTTCCAATTGCACTGATCAATGTGCTAG 
880 890 900 910 920 930 



YVANVFTTQPGGFIPSDFSF 
TTATGTGGCTAATGTTTTTACTACACAGCCAGGAG6TTTTATACCATCAGATTTTAGTTT 
940 950 960 970 930 990 

MNWFLLTNS-STLVBGKLVTK 
TAATAATTGGTTCCTTCTAACTAATAGCTCCACGTTGGTTAGTGGTAAATTAGTTACCAA 

1000 1010 1020 1030 1040 IOjO 

QPLL VNC LWPyPSFEEAAST 
ACAGCCGTTATTAGTTAATTGCTTATGGCCAGTCCCTAGCTTTGAAGAAGCAGCTTCTAC 
1060 1070 1030 1090 1100 1110 

FCFEGAGFD'GCNGAVLNNTV 
ATTTTGTTTTGAGGGTGCTGGCTTTGATCAATGTAATGGTGCTGTTTTAAATAATACTGT 
1120 1130 1140 1150 1160 1170 

DVIRFNLNFTTNVQSGKGAT 
AGAC6TCATTAGGTTCAACCTTAATTTTACTACAAATGTACAATCAGGTAAGGGTGCCAC 
1180 U90 1200 1210 1220 1230 

VFSLNTTGGVTLEISCYTVS 
AGTGTTTTCATTGAACACAACGGGTGGTGTCACTCTTGAAATTTCATG i TATACAGTGAG 
1240 1250 1260 1270 1280 1<-'9U 

DSSFFSYGEIPFGVTDGPRY 
TGACTCGAGCTTTTTCAGTTACGGTGAAATTCCGTTCGGCGTAACTGATGGACCACGGTA 
1300 1310 1320 1330 1340 13oU 

cyvhyngtalkylgtlppsv 

CTGTTACGTACACTATAATGGCACAGCTCTTAAGTATTTAGGAACATTACCACCTAGTGT 
1360 1370 1380 1390 1400 1410 
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KEIAISKWGHFYINGYNFFc; 

CAAGGAGATTGCTATTAGTAAGTGGGGCCATTTTTATATTAATGGTTACAATTTCTTTAG 
1420 1430 1440 1450 1460 1470 

* TFPIDC ISFN|_TTGDSDUFIJ 
CACATTTCCTATTGATTGTATATCTTTTAATTTGACCACTGGTGATAGTGACGTTTTCTG 
1480 1490 1500 1510 1520 1530 

TIAYTSYTEALVQVENTATT 

GACAATAGCTTACACATCGTACACTGAAGCATTAGTACAAGTTGAAAACACA6CTATTAC 
1540 1550 1560 1570 1530 1590 

J5 VTYC N SHVN NIKCSQITAN 
AAAGGTGACGTATTGTAATAGTCACGTTAATAACATTAAATGCTCTCAAATTACTGCTAA 
1600 1610 1620 1630 U40 ifro 



N * G F Y P U 5 S 5 £ V C- l. V h k 5 \J 

TTTbAftTAATGGATTTTATCCTGTTTCTTCAAGTGAAGrTGGTCTTGTCAATAAGAGTGT 
1660 1670 1630 1690 1700 

VLLPSFYTH TIVNITIG LGM 
TGTGTTACTACCTAGCTTTTACACACATACCATTGTTAACATAACTATTGGTCTTGGTAT 
1720 1730 1740 1750 1760 1770 

KRSGYGQP IASTLSNITLPM 
GAAGCGTAGTGGTTATGGTCAACCCATAGCCTCAACATTAAGTAACATCACACTACCAAT 
1780 1790 1800 1810 1820 1830 

Q HNTOVYCIRSDQFSVYVH 
GCAGGATCACAACACCGATGTGTACTGTATTCGTTCTGACCAATTTTCAGTTTATGTTCA 
1840 1850 I860 1870 1880 1890 

STCKSA'LWDN IFKRNCTD»l_ 
TTCTACTTGCAAAAGTGCTTTATGGGACAATATTTTTAAGCGAAACTGCAC6GAC6TTTT 
1900 1910 1920 1930 1940 1950 

DAT AVIKTGTCPFSFDKLNN 
AGATGCCACAGCTGTTATAAAAACTGGTACTTGTCCTTTCTCATTTGATAAATTGAACAA 
I960 1970 1980 1990 2000 2010 

YLTFNXFCLSLSPVGANCKF 
TTACTTAACTTTTAACAAGTTCTGTTTGTC6TTGAGTCCTGTTGGTGCTAATTGTAAGTT 
2020 2030 2040 2050 2060 ? 0 70 
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OVAA RTRTNEQVVRSLYVIY 
' GATGTAGCTGCuCGTACAAGAACCAATGAGCAGGTTGTTAGAAGTTTGTATGTAATATA 
20S0 2090 2100 2110 2120 2130 

EEGDNIVGVPSDNSGVHDLS 

TGAAGAAGGAGACAACATAGTGGGTGTACCGTCTGATAATAGTGGTGTGCACGATTTGTC 
2140 2150 2160 2170 2180 2190 

VL HLDSCTDYNIYGR TGVGI 
AGTGCTACACCTAGATTCCTGCACAGATTACAATATATATGGTAGAACTGGTGTTGGTAT 
2200 2210 2220 2230 2240 2250 

IRGTNRTLLSGLYYTSLSGD 
TATTAGACAAACTAACA6GACGCTACTTAGTGGCTTATATTACACATCACTATCAGGTGA 
2260 2270 2280 2290 2300 2310 

LLGFKNVSDGVI YSVTPCDV 
TTTGTTAGGTTTTAAAAATGTTAGTGATGGTGTCATCTACTCTGTAACGCCATGTGATGT 
2320 2330 2340 2350 2360 2370 

SAQAAV IDGTIVGAITSINS 
AAGCGCACAAGCAGCTGTTATTGATGGTACCATAGTTGGGGCTATCACTTCCATTAACAG 
2380 2390 2400 24i0 2420 2430 

E L i-GL T H U T T T P N FYYY S TV 
! GAACTGTTftGGTC 7 M^C^C A rrGG^L'A^CrtftC/tCCTflfiTT TT TA T7ACTA'"TC If A TATA 
**4u ^50 • 2^60 2470 2480 ' 2490 



NYTNDRTRGTAIDSNfVFnrr 
TAATTACACAAATGATAGGACTCGTGGCACTGCAATTGA^ 



P VITYSNI GVCKNGAFUFtw 

ACCTGTCATAACCTATTCTAACATAGGTGTTTGTAAAAATGGTGCTTTTGTTTTTATTAA 
-•560 2570 2580 2590 2600 2610 

V THSOGDVQPiSTGNv-TIPT 

CGTCACACATTCTGATGGAGACGTGCAACCAATTAGCACTGGTAATGTCACGATACCTAC 
26^0 2630 2640 2650 2660 2670 

, NFT ISVQVEYIQVYTTPUQT 

AAACTTTACCATATCCGTGCAAGTCGAATATATTCAGGTTTACACTACACCAGTGTCAAT 
26w0 2690 2700 2710 2720 2730 
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D CSRYVCNGN P RCNKLLTQY 
AGACTGTTCAAGATATGTTTGTAATGGTAACCCTAGGTGTAACAAATTGTTAACACAATA 
2740 2750 2760 2770 2730 2790 



VSACQTIEQALAMGARLENfl 
CGTTTCTGCATGTCAAACTATTGAGCAAGCACTTGCAATGGGTGCCAGACTT6AAAACAT 
2800 2S10 2S20 2330 2840 2350 



E VDSMLFVS ENALKLASVEA 
GGAGGTTGATTCCATGTTGTTTGTTTCTGAAAATGCCCTTAAATTAGCATCTGTTGAAGC 
2360 2370 2330 2390 2900 2910 



FNSSETLDP IYKEWPNIGGS 
ATTCAATAGTTCAGAAACTTTAGACCCTATTTACAAAGAATGGCCTAATATAGGTGGTTC 
2920 2930 2940 2950 2960 2970 



WLEGLKYILPSHNSKRKYRS 
TTGGCTAGAAGGTCTAAAATACATACTTCCGTCCCATAATAGCAAACGTAAGTATCGTTC 
2930 2990 3000 3010 3020 3030 

AIEDLLFDKVVTSGLGTVDE 
AGCTATAGA6GACTTGCTTTTTGATAAGGTTGTAACATCTGGTTTAGGTACAGTTGATGA 
3040 3050 3060 3070 3030 3090 



DYKRCTGGYDIA DLVCAGYY 
AGATTATAAACGTTGTACAGGTGGTTATGACATAGCTGACTTAGTATGTGCTCAATACTA 
3100 3110 3120 3130 3140 3150 

NGIM VLPGVANADKMTflY TA 
TAATGGCATCATGGTGCTACCTGGTGTGGCTAATGCTGACAAAATGACTATGTACACAGC 
3160 3170 3130 3190 3200 3210 



8LAGGITLGALGGGAVAIPF 
ATCCCTTGCAGGTGGTATAACATTAGGTGCACTTGGTGGAGGCGCCGTGGCTATACCTTT 
3220 323Q 324Q 3260 3270 

AVAVQARLNYYALQTDVLNK 
TGCAGTAGCAGTTCAGGCTAGACTTAATTATGTTGCTCTACAAACTGATGTATTGAACAA 
3280 3290 3300 3310 3320 3330 

NQQ ILASAFNQAIGNITGSF 
AAACCAGCAGATTCTGGCTAGTGCTTTCAATCAAGCTATTGGTAACATTACACAGTCATT 
3340 3350 3360 3370 3380 3390 
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GKVNDA IHGRSRGLATVAK A 
T6GTAA6GTTAATGATGCTATACATCAAAGATCACGAG6TCTTGCTACT6TTBCTAAA6C 
3400 3410 3420 3430 3440 3450 



LAKVQDVVNIQGQALSHLT V 
ATTGGCAAAAGTGCAAGATGTTGTCAACATACAAGGGCAAGCTTTAAGCCACCTAACAGT 
3460 3470 3480 3490 3500 3510 



QLQNNFQAISSSISDIYNRL 
ACAATTGCAAAATAATTTCCAAGCCATTAGTAGTTCTATTA6TGACATTTACAATAGGCT 
3520 3530 3540 3550 3560 3570 



DELSADAQVDRLITGRLTAL 
TGACGAATTGAGTGCTGATGCACAAGTTGACAGGCTGATCACAGGAAGACTTACAGCACT 
3580 3590 3600 3610 3620 3630 



NAFVSGTLTRGAEVRASRQL 
TAATGCATTTGTGTCTCAGACTCTAACCAGACAAGCGGAGGTTAGG6CTAGTAGACAACT 
3640 3650 3660 3670 3630 3690 



AKDKVNECVRSQSQ RFGFCG 
TGCCAAAGACAAGGTTAATGAATGCGTTAGGTCTCAGTCTCAGAGATTCGGATTCTGTGG 
3700 3710 3720 3730 3740 3750 



NGTHLFSLANAAPNGMIFFH 
TAATGGTACACATTTGTTTTCACTCGCAAATGCAGCACCAAATGGCATGATTTTCTTTCA 
3760 3770 3730 3790 3800 38 10 



TVLLPTAYETVTAWPGICAS 
CACAGTGCTATTACCAACGGCTTATGAAACTGTGACTGCTTGGCCAGGTATTTGTGCTTC 
3820 3830 3840 3850 3860 3870 



DGDRTFGL'VVKDVQLTLFRN 
AGATGGTGATCGCACTTTTGGACTTGTCGTTAAAGAT6TCCAGTTGACTTTGTTTCGTAA 
3880 3890 3900 3910 3920 3930 



LD DK FYLTPRTfl YQPRVATS 
TCTAGATGACAAGTTCTATTTGACCCCCAGAACTATGTATCAGCCTAGAGTTGCAACTAG 
3940 3950 3960 3970 3980 3990 



SDFVaiEBCDVLFVMATVSDr 

TTCTGACTTTGTTCAAATTGAAGGGTGCGATGTGCTGTTTGTTAATGCAACTGTAAGTGA 
4000 4010 4020 4030 4040 4050 
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LP SIIPDYIDINQTVQDILE 
TTTGCCTAGTATTATACCTGATTATATTGATATTAATCAGACTGTTCAAGACATATTAGA 
4060 4070 4080 4090 4100 4110 



NFRPNWTV PELTFDIFNATY 
AAATTTTA6ACCAAATTGGACTGTACCTGAGTTGACATTTGACATTTTTAACGCAACCTA 
4120 4130 4140 4150 4160 4170 



LNLTGE IDDLEFRSEKLHNT 
TTTAAACCTGACTGGTGAAATTGATGACTTAGAATTTAGGTCAGAAAAGCTACATAACAC 
4180 4190 4200 4210 4220 4230 



TVELAILIDNINNTL VNLEW 
CACTGTAGAACTTGCCATTCTCATTGACAACATTAACAATACATTAGTCAATCTTGAATG 
4240 4250 4260 4270 4280 4290 



L N R IETYVKWP WYVULL I GL 
GCTCAATAGAATTGAAACCTATGTAAAATGGCCTTGGT ATGTGTGGCTACTAATAGGCTT 
4300 4310 4320 4330 4340 4350 



MVIFCIP LLLFC'CCSTGCCG 
AGTAGTAATATTTTGCATAnrftTTArT nr TATTTTf;rTr;TTr}TAriTArar!FiTTRrTRirriR 
4360 4370 4380 4390 4400 4410 



CIGCLGSCCHSICSRRQFEN 
ATGCATAGGTTGTTTAGGAAGTTGTTGTCACTCTATATGTAGTAGAAGACAATTTGAAAA 
4420 4430 4440 4450 4460 4470 

YEP IEKVHVh1*I*NVNSIIC 
TTACGAACCAATTGAAAAAGTGCACGTCCATTAAATTTAAAATGTTAATTCTATCATCTG 
44S0 4490 4500 4510 4520 4530 



Ynsscfc*rillrm«nkvfk 
ctataatagcagttgtttctgctagagaattttgttaaggatgatgaataaagtctttaa 

4540 4550 4560 4570 4580 4590 



N * T 
GAACTAAACT 
4600 
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